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Abstract. Let the random variable Z n ^ denote the number of increasing subse- 
quences of length k in a random permutation from S n , the symmetric group of 
permutations of {l,...,n}. We show that Var(Z n ^ n ) = o((EZ n ^ n ) 2 ) as n — > oo if 

and only if k n = o(ns ). In particular then, the weak law of large numbers holds for 
^ ■ z n,k n if k n = o(n5); that is, 

O ' 

Olim '—H— = 1, in probability, 
moo RZ„ >„ 



We also show the following approximation result for the uniform measure U n on Sn 
Define the probability measure /J. n -fe n on S n by 
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n;x lt X2,...x kn > 

<x 2 <...<a; fc 



where U n ;x 1 ,x2,...,x k denotes the uniform measure on the subset of permutations 
which contain the increasing subsequence {xi, X2, Then the weak law of 

large numbers holds for Z nkn if and only if 

(*) lim \\ l x n . kn -U n \\=Q, 

n — >oo 

where || • || denotes the total variation norm. In particular then, (*) holds if k n = 

2 

o(n5 ). 

In order to evaluate the asymptotic behavior of the second moment, we need to 
analyze occupation times of certain conditioned two-dimensional random walks. 
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1. Introduction and Statement of Results. Let S n denote the symmetric 
group of permutations of {1, n}. By introducing the uniform probability measure 
U n on S n , one can consider a G S n as a random permutation. Probabilities and 
expectations according to U n will frequently be denoted by the generic notation 
P and E respectively. The problem of analyzing the distribution of the length, 
L n , of the longest increasing subsequence in a random permutation from S n has 
a long and distinguished history; see [1] and references therein. In particular, the 
work of Logan and Shepp [6] together with that of Vershik and Kerov [8] show that 

EL n ~ 2n^ and that a 2 (L n ) = o(n), as n — > oo. Profound recent work by Baik, 

i 

Deift and Johansson [2] has shown that lim n ^ 00 P( L "~| n!? < x) = F(x), where F 

is an explicitly identifiable function. 

There doesn't seem to be any literature on the random variable Z n ^ = Z n ^(a), 

which we define to be the number of increasing subsequences of length k in a 

f 1 2 3 4 5 \ 

permutation a G S n . Thus, for example, if a = I . I , then Z^^{a) = 



x l 3 4 5 2 / 

4 because there are four increasing subsequences of length three; namely, 134, 
135, 145 and 345. It is useful to represent Z n ^ as a sum of indicator random 
variables. For positive integers {x\, ...,£&} satisfying 1 < x\ < < ... < Xk < n, 
let B™ i Xk C S n denote the subset of permutations which contain the increasing 
subsequence {xi,X2, ...,Xk}- Then we have 



a;i<a;2<---<a;fc 



where the sum is over the (^) distinct increasing subsequences of length k. Since 
the probability that a random permutation fixes any particular increasing sequence 
of length k is ^ , it follows that the expected value of is given by 

(i.i) £Z n , fc = ^f- 



One can consider to depend on n in which case we write k n . We are interested 
in a law of large numbers of the form / £^ ,fc " — > 1 in probability, for appropriate 



choices of k n . Of course, in light of the above cited works on the longest increasing 
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subsequence, such a result cannot hold for k n > cn^ with c > 2. A straightforward 
calculation using Stirling's formula shows that 



ez x-^-y* 

n ' cn2 27TC712 C 



1 

(-) 2 n 1 2< , as n -> oo, for / e (0, -); 



i 

r cn2 — 1 , 



(For the case fc n = cna, we have used the fact that lim n ^ 00 IIj=o (-*- — n) = 

2 

exp(— -y), which is proved by taking the logarithm of the above product. Note that 
the factor exp(— ^-) suddenly appears in the formula when I = |.) In particular 
then, it follows from (1.2) that lim^^oo EZ n ^ n = oo, if k n < cnh with c < e, and 
lim^oo EZ n ^ kn = 0, if k n > emh . 

The law of large numbers for Z n ^ n is in fact equivalent to a certain approximation 
result for the uniform measure, which we now describe. Recall that for probability 
measures P\ and P2 on S n , the total variation norm is defined by 

\\P 1 -P 2 \\=u^{P 1 {A)-P 2 {A)) = \ £ {P^a) - P 2 (a)\. 

For x\ < x 2 < ■•• < Xk n , let U n;XltX2tmmm>Xk denote the uniform measure on permuta- 
tions which have {x\, x 2 , as an increasing sequence; that is U n - XljX2t .„ >Xk is 
uniform on 5™ ^...^ • Note that U n ; Xl , Xa ,..., Xkn is defined by U n . XltXa ,... jXkn (<r) = 
^1b^ X2 Xk (cr). Now define the probability measure /x n; fc„ on S n by 



A t n;fc„ — / n \ U n;Xl ,X2,...x kr 
\k n ) x 1 <X2<---<x kn 



Equivalently, 



1 ! 

The measure /U n; fc„ can be realized concretely as follows. Consider n cards, num- 
bered from 1 to n, and laid out on a table from left to right in increasing order. 
Place a black mark on k n of the cards, chosen at random. Pick up all the cards 
without black marks and then randomly insert them between the k n cards with 

black marks that remain on the table. The resulting distribution is n n -,k n - 
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Proposition 1. The law of large numbers holds for Z n ^ n ; that is 

lim — — 1 " = 1 in probability, 

n^oo EZ„ h 

if and only if 

(1-4) lim \\/J, n . kn -U n \\ = 0. 

The proof of Proposition 1 appears at the end of this section. 

The measure fJ> n ;k n corresponds to ignoring a set of k n random cards and ran- 
domizing the rest of the cards. How many random cards can one afford to ignore 
like this and maintain asymptotic randomness? Corollary 2 below shows that one 

2 

can afford to ignore k n — o(ti 5 ) cards, while the results cited above on the longest 
increasing subsequence show that one certainly cannot afford to ignore cna cards 
for c > 2. 

For the law of large numbers we will use Chebyshev's inequality. The calcu- 
lation of the second moment is nontrivial because it involves expectations of the 
form Elnn 1r" , and these expectations depend rather intimately on 

the relative positions of {xi,X2, ....,Xk n } and {yi,y2, We begin with the 

explicit form of the second moment of Z n ^ for any k < n. 

Proposition 2. 

where 

V J l r =N r =° K y 
V J ' m r =N 

In order to evaluate the asymptotic behavior of Yox{Z n ^ n ) , one must be able 

to adequately evaluate the asymptotic behavior of A(k n — In fact, it turns 

out that we need a good lower bound for A(k n — 1,1) and a good upper bound for 
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A(k n — j,j), for all j = 1, 2, ...k n . We were able to interpret A ( i^'J) as the sum of 

( N ) 

certain expected occupation times of the horizontal axis for the standard, simple, 
symmetric two-dimensional random walk starting from the origin and conditioned 
on returning to the origin at the 27V-th step. This characterization was sufficient 
to obtain the appropriate bounds to prove the following theorem. 

Theorem 1. 

i. If k n = o(ns), then 

5 

Var(Z n u ) ~,kn . 
(EZ Ujkn ) n 

In particular then, Var(Z njkri ) = o((E Z n ^ kn ) 2 ) , as n — > oo. 

2 2 

ii. If cins < k n < for constants ci,C2 > 0, t/ien 

c 3 (EZ njkn ) 2 < Var(Z n , kn ) < c 4 (EZ njkn ) 2 , 
for constants C3, C4 > 0. 

2 1 

m. 7/ lim n ^. 00 n~^k n = 00 and limsup n ^ oc n~2 fc n < 00, then 

Var(Z n , kn ) 
lim — — ^ — h^- = 00. 

n^oo (£Z n , fc J 2 

Corollary 1. i. If k n = o{n$), then 

lim — — 1 " =1, in probability, 



ii. If k n = 0(ns), then 



liminfP( n,kn > 5) > 0, for some 5 > 0. 

n^oo EZ n u 



Part (i) of Corollary 1 follows immediately from Chebyshev's inequality and 
Theorem 1-i. The proof of part (ii) of Corollary 1 appears below. 

Corollary 1 and Proposition 1 yield immediately the following approximation 
result. 



Corollary 2. If k n = o(ns), then 



lim \\fl n -,k n -U n \\= 0. 

n^oo 

In light of the above results, we pose the following question: 
Open Question: Presumably there exists a critical exponent l c such that the law 
of large numbers holds for Z nn i with I < l c and does not hold for I > l c . What is 

lc? 

In section two we prove Proposition 2 and in section 3 we prove Theorem 1. 
Lemmas 2 and 3, which appear in section 3 and give the key estimates on A(N,j) 
used in the proof of Theorem 1, are proved in section four. 

The literature on increasing subsequences in random permutations in a context 
other than that of the largest such subsequence is very scarce. The random variable 
Z n , defined as the total number of increasing subsequences of all possible lengths in 
a random permutation, was studied in [5]. Both EZ n and Var(Z n ) were calculated 
explicitly and evaluated asymptotically. It turns out that Var(Z n ) is of a larger 
order than (EZ n ) 2 , so it is not possible to apply Chebyshev's inequality and obtain a 
law of large numbers. However, the authors were able to show that x" — converges 
in probability and in mean to a positive constant. In [3], the random variable Z n ^ 
actually appears in a different guise. Equation (1.1) appears there as well as an 
upper bound for EZ i ; however, this random variable is not the object of study 
in that paper. In [7], inversions-which are decreasing subsequences of length 2 — are 
studied, and a central limit theorem is proved. 

We conclude this section with the proofs of Corollary 1-ii and Proposition 1. 

Proof of Corollary 1-ii. Assume to the contrary that the result is not true. Then 

there exists a subsequence {{n i ,k ni )}^ 1 of {(n, k n )}^L 1: such that F "'' fc "» goes 
to in probability. By taking a further subsequence if necessary, we may assume 

2 _ 2 

that either k ni = o(nf) as i — > oo, or lim^oo n i 5 k ni = c > 0. In light of part 



(i), we obtain a contradiction in the former case. Thus, it remains to consider the 



latter case. In this case, it follows from Theorem 1-ii that tt — ,Z' — T is bounded 
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away from and oo. Using this along with the assumption that ni > in 



probability, we conclude that 

Zr, . U — E Z tl . U 

(1.6) lim P( —. n * " n < < -p) = 1, 

for some p > 0. However, since the second moments of the — = = are 

v /Var(Z„., fe „.) 

equal to 1, this quotient is uniformly bounded. The uniform boundedness along 
with (1.6) contradict the fact that the first moment of lrr = is 0. □ 

Proof of Proposition 1. For e G (0, 1), define 

^ = {aGS B :^6[l-£,l + £ ]}. 
We claim that (1.4) holds if and only if 

(1.7) lim U n {D c n ^ kn ) = 0, foralUX). 

We first show the sufficiency of (1.7). Since lim n ^ 00 U n {D n ^ kn ) = 1, it follows 
from the definition of -D n ,e,fc„ that limin^^oo /i n; fc n (-D n ,e,fc„) > 1 ~~ e ? an d thus 

(1-8) lim sup (D£ >eifc J < e. 

n^oo 

Thus, for any C 5' n , we have 

\U n (A n ) - /j, n . kn (A n )\ < \U n (A n n A»,e,fcJ - fi n . kn (A n n A», e ,fcJI 

(1.9) 

+ |t/ n (A n n D£, e , fc J - Mn ;fc „(A n n D c n ^ k J\. 

By the definition of D nj6jkn , the first term on the right hand side of (1.9) is no 
greater than e. By (1.7) and (1.8), the lim sup of the second term on the right hand 
side of (1.9) is no greater than e. Since e > is arbitrary, we conclude that 

lim \U n (A n ) - ii„-k n {A„)\ = 0. 

Since the sets {A n } are arbitrary, this proves (1.4). 
We now show the necessity of (1.7). Let 

C n ,e,k n ={a e S n : ^ffj^r- < 1 - e}. 
7 



If (1.7) does not hold, then we may assume without loss of generality that there 
exists a 5 > and an eo > such that U n (C n>e0j k n ) > ^ f° r all n - But then, from 
the definition of C n , eo , kn , it follows that fx n - kn (C n ,e ,fcJ < (1 - ^o)U n {C n , eo , kn ), and 
thus |t/n(C n>e0l fc„) - fJtn;k n (C nje0t k n )\ > e S, for all n, which shows that (1.4) does 
not hold. 

To complete the proof of the proposition then, it remains to prove that (1.7) 
holds if and only if the law of large numbers holds. Using (1.3) for the first equality 
below, and using (1.1) for the second equality, we have 

1 kj „ jr l-£ 1 + e, 

(1.10) 



Un(D^ k J = P(j^^Z n , kn t [_, _]) 
\k n ) 



& Z n,k n 

From (1.10), it follows that (1.7) holds if and only if the law of large numbers holds 

for Z n ^ kn . □ 

2. Proof of Proposition 2. From the definition of Z n k , it follows that 



(2.1) EZ k = \ EIb™ Ib™ 



where the sum is over the (™) 2 pairs 5^ X2 Xfe , ya yfe with x x < x 2 < ... < x k 
and yi < yo < ... < y k . It turns out that Eln™ 1r» depends rather 

intimately on the relative positions of {xi,x 2 , ■ ■■jXk} an d {yi,y 2 , ■■■,Hk\- 

Let j G {0,1,..., fc}. For any particular subset A C {1, 2, n} satisfying |A| = 
2/c-j, there are ( 2fc ~ J ) ( 2 Jlf ) ordered pairs of sets B% M _ >Xk , B£ llV2l ... ltffc for which 
{x\,X2, ...,x k } U {7/1,7/2, ■■■iVk} = A. Of course, it follows that |{xi, x 2 , x k } n 
{yi,J/2,-,yfc}| = j- We will say that such a pair -B™ liX2i ..., Xfe , -B™,^,...,^ corre- 
sponds to A. For any pair B x l iX2 ^ Xk ,B yiy2 ^ yk corresponding to A, there exist 
numbers {l r } J r= o and {m r } J r=Q such that exactly Zo elements of {x\,X2, ■■■,x k } and 
mo elements of {yi,y2, J/fe} strictly precede the first element that is common to 
{xi, #2, a^fc} and {yi, 7/2, ...,y k }, exactly l r elements of {x\, X2, x k } and m r ele- 
ments of {j/i, 7/2) •••) y/c} fall strictly between the r-th and the (r + l)-th element that 

is common to {xi, x 2 , x k } and {t/i, 7/2, y k }, for r = 1, 2, j — 1, and exactly 
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Ij elements of {x\,X2, ■ and rrij elements of {yi,y2, Uk} strictly follow the 

j'-th and final element that is common to {xi, X2, x^} and {y±, 1/2, y k }- We 
will refer to the numbers {l r } J r=0 and {m r } J r=0 as the "interlacing numbers" for the 
pair B Xi X2 ^ Xk , B yiy2 ^ yk . 

Lemma 1. Let the pair B XlX2 ^ Xk ,B yiy2 ^ yk satisfy 

\{xi,X2,...,x k }n{y 1 ,y 2 ,...,yk}\ =j 
and let {l r } J r= o, { m rY r =o be the corresponding interlacing numbers. Then 

(2-2) E1 B , l Bn =-^--,17%^^. 

Proof. Without loss of generality, we may assume that n = 2k — j, since only the 
relative positions of the 2k — j distinct points in the set {x±, X2, x k , yi, j/2> •••> yk} 
are relevant. Thus, we are considering permutations from S 2 k-j- For each r = 
0, 1, j, consider the l r + m r positions between common elements (of course, for 
r = and r = j, "between" is not the correct word). There are (l r + m r )\ ways 
to fill these positions. However, if we require that the l r positions reserved for the 
x-chain and the m r positions reserved for the ?/-chain be in increasing order, this 
reduces the number of ways to ^fr^r ■ Thus, there are Yll-n ^4^r I r ways to 
fill all the positions so that B™ l X Xk fl B™ y yk will occur, and of course, all 
together there are (2k — j)! ways to fill the positions with no restrictions. □ 
We now complete the proof of the proposition. Simple combinatorial considera- 
tions show that out of the ( „~ J ') ( fel^ 7 ) pairs B XiX2 ^ Xk , B yiy2 ^ yk corresponding 
to a set A satisfying | A\ = 2k - j, there are U^ (^t^) = Ili=o ^JXf' of them 
with the interlacing numbers {l r } J r= o, { m r} J r= o- Using this fact along with (2.1), 
(2.2) and the fact that there are { 2 k-j) distinct subsets A C {1,2, ...,n} such that 
\A\ = 2k — j, we obtain the formula for EZ% k in Proposition 2. □ 

3. Proof of Theorem 1. Similar to (2.1), we can write the variance of Z n ^ n in 

the form 

(3.1) 

Var(Z n k )= > Els* 1b™ — > EIb™ EIb™ , 
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where the sum is over the (™) 2 pairs /^ ; .,,, ^ ; . ;/:! ,„ n - By (2.2), 

Elfin 1 fin — Elfin Elfin = 0, if 

x 1^2'- -^k n W1.W2. ■■■>»*;„ a! l. a; 2.---. ! > : fc n Wl. f2. ■■.Vfc„ 

(3.2) 

{ari,X2 J ...,xjfc n } and y 2 , are disjoint. 
The number of pairs {xi, X2, ...,Xk n }, {yi, yi, y&„} which are not disjoint is equal 
t° (/T) 2 — (fc n )( n fc fc ™) - ^ k n = o(n^), then a simple calculation reveals that 

a n j 2 -( fc n jr^)=-(( fc n j 2 )-Thus, 



(3.3) 



EE1 Rn £"1 
a: 1 ,X2,...,a: fcm 

{x 1 ,x 2 ,...,x kn }n{y 1 ,y 2 ,...,y kn }^0 



B™ 

a: 1 ,X2,...,a: fc H1 , y 2 , ■ ■ ■ , y k 



n 



2 



n \ fn - k n \ \ 1 . (r ) 



2 



o(^) = o((i?Z n , fc J 2 ), 



s ^n/ \knj \ k n J J (/c n !) 2 (^n-) 2 

where the final equality follows from (1.1). On the other hand, if it is not true that 
k n = 0(712), then the left hand side of (3.3) will be 0((EZ njkn ) 2 ). In light of this 
last remark along with (3.1)-(3.3), the theorem will be proved once we show that 

{x 1 ,x 2 ,...,x kn }n{y 1 ,y 2 ,...,y kn }^0 B x 1} x 2 ,...,x kn B y 1 ,y 2 ,--.,y kn _ f)(^n s 

(3.4-a) ' " {EZ n , kn f ~~ " n U 

if k n is as in part (i); 

EE 1 1 
{x 1 ,x 2 ,...,x kn }n{y 1 ,y 2 ,...,y kn }^0 B x 1 ,x 2 ,...,x kn B y 1 ,y 2 ,...,y krl 

(3.4-b) {!■:/.,,>,,)- ^ ' 

is bounded from and oo if k n is as in part (ii); 

EE~\ 1 
{xi,x 2 ,...,x fc „}n{yi,y2,---,yfe„}/0 B x 1: x 2 ,...,x k B y\,y 2 ,-..,y kn 
lim t— — — — = OO, 

(3.4-c) n^oo (EZ njkn ) 2 
if k n is as in part (iii). 
By Proposition 2 and its proof, it follows that 

E Elfin 1/3™ 
x 1 ,x 2 ,...,x kn y\ ,y 2 ,.. . ,yfc n 

{x 1 ,x 2 ,...,x kn }n{y 1 ,y 2 ,...,y kn }^0 

kn ( n \ 1 
= > I , 1 1—, TiMkn — j, j), where A(N, j) is as in (1.5). 

pi\^kn-jj (2kn-j)\ ^ J,Jh ^ ,J> K ' 

( ™ ) 

Using this with (3.4) and the fact that EZ Ujkn = yf, the proof will be complete 

if we show that 

(3.5-a) 
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< 3 - 6 - b > iif t= 

if k n is as in part (ii) ; 

(3.5-c) 



kn f ft \ 1 

^ ( oi ■ J rr:A(k n - j,j) is bounded from and 

\2k n - j J (2k n -j)\ 



oo 



(r) 2 ^V2*,.-.7(2fc„-i)! 



It remains therefore to analyze the left hand side of (3.5). In the next section 
we will prove the following key estimates: 

Lemma 2. For each p G (0, oo), there exists a constant C p > such that 

A(NJ)<C> -^-(2N)i^ , forj,N>l and ^ < p. 
In particular, since A(N,j) is increasing in N, one has 

A(k-jJ)<A(kJ)<C(^^(2k)i^ , forj,k>landj<k. 

Lemma 3. There exists a constant C > such that 

i /9k — 9^ 2 
A(k-l,l)>C(2k-2)*' ~ 



k-1 

We now use Lemma 2 to show that (3.5-a) and the part of (3.5-b) concerning 
boundedness from oo hold. Afterwards, we will use Lemma 3 to show that (3.5-c) 
and the part of (3.5-b) concerning boundedness from hold. 

In light of Lemma 2, it suffices to show that (3.5-a) and the part of (3.5-b) con- 
cerning boundedness from oo hold with A(k n — j, j) replaced by C{ r /j+i (2k n ) i ( 2 fc fc ™) • 
Letting 



fl(n '^ )= ^U-iJ^^i(^j (2 ^ 

it follows that (3.5-a) (respectively the part of (3.5-b) concerning boundedness from 
oo) will hold if we show that 



3=1 { 2 ) 



11 



5 

is O (-£-), if k n is as in part (i) (respectively, bounded if k n is as in part (ii)). 
Simplifying and making some cancellations, we have 

We have 

for positive constants b±, 62- (For the lower bound, we have used the fact that k n 
is of an order not larger than . The upper bound holds as long as k n < cn for 
some c < 1.) We also have 

(3.8) k n < { ^ n }\ ){ < (2k n ) j , J = 1, K. 
From (3.6)-(3.8) we have 

(3.9) B^KJ)^^ < b 2 n-i(2k n )V (2k n )^ < J-—^- 1 ^, 

. 1 j 

for some C > 0. Since J27Li i/j+i < 00, it follows from (3.9) that 



5 
k 2 



0. The term in (3.5-b,c) corresponding to j = 1 is [ k ^% {2^-1) (2k 



is O(-f-) as n — > 00, if k n is as in part (i), and is bounded if k n is as in part (ii). 
This proves (3.5-a) and the part of (3.5-b) concerning boundedness from 00. 

We now turn to (3.5-c) and the part of (3.5-b) concerning boundedness from 

(fc n !) 2 

\2k 

1,1). Define C(n, k n ) = ( 2 Z- 1) W^Ty.(^ n - 2)* ( 2 *;_- 2 ) 2 . Using the bound 
on A(k n — 1,1) from Lemma 3, it follows that for the part of (3.5-b) concerning 
boundedness from 0, it is enough to show that limin^^oo C(n, k n ) > 0, when k n 
is as in part (ii), and for (3.5-c) it is enough to show that lim^^oo C(n, k n ) = 00, 
when k n is as in part (iii). Simplifying and making some cancellations, we have 
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Using this with (3.7) gives 



C(n,k n ) > b 1 n- 1 kt l (2k n -l)- 2 (2k n -2)?. 

Thus, the above stated inequalities indeed hold. □ 
4. Proofs of Lemmas 2 and 3. 

Proof of Lemma 2. The first step of the proof is to develop a probabilistic repre- 
sentation for A(N,j). Fix j > 1 and N > 1. Consider two rows each containing 2N 
spaces. Randomly fill each of the two rows with N blue balls and N white balls. 
Define Xq = 0, and then for m = 1, 2, 2iV use the balls in the first row to define 
X m as the number of blue balls in the first m spaces minus the number of white balls 
in the first m spaces. Define Y m the same way using the balls in the second row. 
Then {X m } and {Y m } are independent, and as is well known, each one has the 
distribution of the simple, symmetric one-dimensional random walk, conditioned 
to return to at the 2iV-th step. Let U m = X ™+ Y ™ and V m = X ™~ Y ™ . Then 
(U m ,V m ) has the distribution of the standard, simple, symmetric two-dimensional 
random walk (jumping one unit in each of the four possible directions with probabil- 
ity j), starting from the origin and conditioned to return to the origin at the 2iV-th 
step. To see this, let {X m } and {y m } be independent copies of the unconditioned, 
simple, symmetric one-dimensional random walk starting from the origin, and let 
U m _ x m +y m anc j _ x m -y m _ r r j ien c i ear i y {Um.Vm} is the unconditioned, 

simple, symmetric two-dimensional random walk starting from the origin. Now 
{Um.Vjn} is equal to {U m ,V m } conditioned on X 2 n = 3^2Ar = 0, or equivalently, 
conditioned on U2N = V2N = 0. 

The total number of possible ways of placing iV blue balls and iV white balls in 

N J . For the 

moment, fix a set {s r } J r=0 of j + 1 nonnegative integers satisfying X!r=o Sr = ■ 

Let t r = YH=o s i- Let D so,s!,..., Sj denote the event {V to = V tl = ... = V tj = 0} = 

{X to = Y to ,X tl = Y tl ,...,X t . = Y t .}. Now for any sequence {Z r }J=o satisfying 

l r < s r and Xlr=o ^ r = ^ ne probability of the event {X tr = Yli=o(h ~ ( s « — 

13 



ZO), r = 0,1 j} = = ELo(2/, - *), r = 0,1, ...J} is (?) ft), 

and thus the probability of the event {X tr = Y tr = J2l =0 (2h — Si), r = 0, 1, j} 
* s Cn) (rir=o (r)) 2 - Summing now over all possible {l r } J r= o as above, it follows 
that 

i i / \ 2 



^0,^)=^ e n v/ 

l AT J ^ ir=N r=0 V 

Z_/ r =0 

Letting m r = s r — l r > 0, one sees that the term involving the summation on the 
right hand side of (4.1) can be written as Yl Yli=o (llmr) ' where the sum is 
over all {l r } J r=0 an ^ { m r}r=o satisfying J2 J r =o ^ = EJ=o mr = an< ^ lr + m r = s r , 
for r = 0, 1, Thus, summing (4.1) over all the possible choices of {<s r }^ =0 , we 
obtain 

(4.2) V P(^„„ .,) = ^ 



2JV 



/27V\ 2 
I TV J 



The next step of the proof is to estimate P(£) SOjSlj ... jS .). For this we will need 
several lemmas. 

Lemma 4. Le£ {Z n }^L ^ e one- dimensional random walk which takes jumps of 
±1 with probability | eac/i, and remains in its place with probability \ . Then there 
exit constants C\ , C<i > such that 

-4 < P(Z n = 0\Z = 0) < -4, /or n > 1. 
Proof. A direct calculation gives 

[-] 

p(^ = o|^ = o) = B5)-*(j)*(")("7 < )- 

We rewrite this as 
(4.3) 

P( Zn = o,z = o) = £§i = (i)-E(^Q ft 

14 



By Stirling's approximation, there exist positive constants c\, c 2 such that 



Thus, from (4.3) and (4.4) we have 

[— ] ["] 

Now let SVi be a random variable distributed according to Binom(n, -|). Then we 
have 

[-] 

(«) t 7! L=( 2 ".)(ir = £ (is„ + i)-i i {s _>. 

By standard large deviations estimates, P(\ ^ — \\ > e) decays exponentially in 
n for each e > 0. Using this along with (4.5) and (4.6) and leaving to the reader 
the little argument to accommodate the requirement in (4.6) that S n be even, we 
conclude that there exist constants C\ , C2 > such that 

( ' ! < P(Z n = o\z = 0) < ( 2 



□ 

Lemma 5. Let {Z n }^ =0 be a simple, symmetric one- dimensional random walk. 

i. There exists a constant Co > such that 

C a 2 

P(Z n = 0\Zq = a) < —= exp( ), for all a G Z and all n > 1. 

2n 

ii. Let L > 0. There exists a constant cl > such that for all sufficiently large n, 

c 0? 

P(Z 2n = 0\Zo = 2a) > —= exp( ), for all a G Z satisfying \a\ < Lrv 1 . 

^Jn n 

Proof. The lemma follows from the local central limit theorem. It can be proved 
via a direct calculation, using Stirling's approximation. (See, for example, [4, page 
65].) □ 

Lemma 6. Let {X n ,Y n }^L be a simple, symmetric two-dimensional random walk. 

15 



i. There exist constants c\ , c 2 > such that 

2 

P((X n ,Y n ) = (0,0)|(X ,y ) = (o,0)) < — exp(-— ), /or alloG Z and a//n > 1. 

77i 



u. There exists a constant C3 > suc/i t/iat 

P((^ 2n ,y 2n ) = (0,0)|(X o ,lb) = (0,0)) > ^, /or all n > 1. 

n 

Proof. Let and V n denote respectively the number of horizontal and the number 
of vertical steps made by the random walk {(X.,Y.)} during its first n steps. Then 
we have 

p((x n ,Y n ) = (o,o)|(x ,y ) = M)) = 

(4 ' 7) P(Zj = 0\Z Q = a)P(Z k = 0\Z o = 0)xP(H n = j,V n = k), 

j+k=n 

where {Z n } is as in Lemma 5. Since H n and V n are each distributed like Binom(n, |), 
a standard large deviations estimate gives 

(4.8) P(H n > X -n,V n >-n)>\ - I e xp(-6V), 

for some C > 0. Since exp(— |j) < ^=exp(— |^-), for |n < j < n, it follows 
from (4.7), (4.8) and Lemma 5-i that 

AC 2 a 2 1 

(4.9) P((X n ,Y n ) = (0,0)|(X ,y ) = (a,0)) < — ^ exp( ) + -exp(-Cn). 

n no 

Choosing ci sufficiently large and c 2 > sufficiently small, part (i) follows from 
(4.9) along with the fact that we need only consider \a\ < n. 
For part (ii), note that 

P((X 2n ,Y 2n ) = (0,0)\(Xo,%) = (0,0)) = 

P(Z2j = 0\Z = 0)P(Z 2k = 0\Z = 0) x P(H n = 2j, V n = 2k). 

2j+2fc=2n 

Also, we have P(H n and V n are even, H n > \n,V n > \n) > C, for some C > 

independent of n. Finally, P(Z 2 j = 0\Z = 0) can be bounded from below as in 

Lemma 5-ii. Part (ii) follows from these observations. □ 
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We can now estimate P(-D S0)Slj ... )S ■)• 
Lemma 7. Let s r = s r + 1 . There exists a constant c > snc/i £/ia£ 

(4.10) P(D SOiSl _ Sj ) < (2N+l)^ +1 (sos 1 ...s j )-^. 

Proof. Let {X n ,Y n }^ =Q be a simple, symmetric two-dimensional random walk. 

Recalling that tj = 2iV, it follows by definition that 

(4.11) 

P(D S0)S1 ,..., S ,) = P(Y t0 =Y tl = --- = Y t ._ x = 0\X = Y = X 2N = Y 2N = 0). 

By the Markov property, we have 

P(Y t0 =Y tl = --- = = 0|X = % = X 2N = Y 2N = 0) = 

P(Y t0 = 0\X = Y = 0) • P(Y tl = 0\Y to = X = % = 0) x • • -x 

(4.12) . . . . 

P(Yt^ = 0\Y to = ■■■ = Y t ^ 2 =X o = Y o = 0)x 

P(X 2N = Y 2N = 0\Y t() = ■ ■ %_ x = X Q = % = 0) 
P(X 2N = Y 2N = 0\X = Y = 0) 

Note that the process {Y n } in isolation is a one-dimensional random walk dis- 
tributed according to the distribution of {Z n } in Lemma 4. Thus, letting t-i = 0, 
we have from (4.12) 
(4.13) 

P(Y t0 =Y tl = --- = Y tj _, = 0|X = Y = X 2N = Y 2N = 0) = 

•••^ .- V " *» () ' x fl P(Z tk = 0\Z tk 1 = 0). 
P(X 2N = Y 2N = 0|X = Y = 0) ^ 

Recall that — tk-i = Sk and Sj = 2N — tj-i. Let s' k = Sfc, if st > 1, and s' fc = 1, 
if s k = 0. Since P(Z tk = 0|Z tfe _ 1 = 0) = P(Z tfe _ tfe _ 1 = 0|Z = 0) it follows from 
Lemma 4 that P(Z tk = 0\Z tk _ 1 = 0) < -7=. From Lemma 6-ii, it follows that 

V s fe 

P(X 2A r = y 2 Ar = 0|X = % = 0) > Using these facts along with (4.11) and 

(4.13) , it follows that if we show that 

(4.14) P(X 2N = Y 2N = 0|Y to = • • -Y t , = X = Y = 0) < —j 



Ns(2N-t j - 1 ) 2 
17 



for some C > and Sj = 2N — tj-i > 1, then we will obtain (4.10) with s r replaced 
by s' r . By increasing the constant c in (4.10), one can always replace s' r by s r . 
Thus, it remains to prove (4.14). 
By the Markov property, we have 

p(x 2N = y 2N = o|f to = • • = x = Y = o) = 

P{X 2N _ tj ^ = Y2N—tj-i = 0\Xo = a,Y = 0) x is(a), 



(4.15) 



aER 



where 



u(a) = P(i 4j _, = a|y t0 = • • = X = Y = 0). 

As in the proof of Lemma 6, let H n denote the number of horizontal steps taken 
by the random walk {X(-),Y(-)} during its first n steps. Let 

Him) = = m\Y t0 = ■ ■ = X = Y = 0), 

and let W be distributed like \i. Then v is distributed like Z w , where {Z n } is a 
simple, symmetric one-dimensional random walk, starting from and independent 
of W. We will show later that for some 7, C > 0, 

//([0,7^-iD = ^ 7*i-i|^ = • • =Xo = % = 0) = 

(4.16) C i 

P(W < 7^-1) < — exp(-7^_ 1 ). 

7 

By Lemma 5-i, it follows that 

C a 2 

(4.17) P{Z n = a\Z = 0) < ° exp(-- ), for 7 t,_ 1 < n < 

V 7*j'-i ^j-i 

From (4.16) and (4.17) we conclude that 

C a 2 C- 7 

(4.18) !/(o) = P(Z W = a) < — =^exp(-- ) + — exp(-7^_i). 

V7*j'-i 2 *j-i 7 

Since z/(a) = 0, if a > it follows from (4.18) that 

(4.19) ,(a)<^Lexp(-^!), 

for some fci, k 2 > 0. From (4.15), (4.19) and Lemma 6-i, we obtain 

P(X 2N = Y 2N = 0\Y to = ■ ■ = X = Y Q = 0) < 

(4-20) _ Cl C2 a 2 fcj fc 2 a 2 . 

5^7--P(-^7T^)7^-P(-7-_7)- 
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For an appropriate C > 0, the right hand side of (4.20) can be bounded from above 

by C j JZo (2jv-t J _ 1 1 ) v /t J _ 1 ex P(~ 2jv-t J 2 -J exp ^~^T^ x - Evaluatin g this integral 
gives the estimate in (4.14). Thus, to complete the proof of the lemma, it remains 

to prove (4.16). 

We mention that it is intuitive that P(H tj _ 1 < jtj-i\Y to = "•Yt j _ 1 = Xq = Yq = 
0) < P(H tj _ 1 < jtj-i) : and this would then give (4.16). The intuition comes from 
the fact that the smaller H t ._ 1 is, the more moves {Y n } makes, and the more moves 
{Y n } makes, the more difficult it is for it to have the prescribed zeroes. However, 
a proof of this is rather complicated and quite tedious. It turns out that a rather 
crude estimate will suffice in order to obtain (4.16). We have 
(4.21) 

M[0,7^-iD = P{H ti _ x < = • • = X = Y = 0) = 

PiHt^ < 7^-1,^0 = • • = X = % = 0) ^ P(^ J _ 1 < 7^-1) 



< 



P(Y t0 = ■ ■ -Y t j _ 1 =X = Y = 0) ~ P(Y t0 = ■ ■ -Yt^ = 0\Y = 0) 

By a standard large deviations estimate, 



(4.22) P(H t _j < 7^-1) < cexp(-/ 7 ^_i), where lim l 1 = log 2. 

"7 — ^0 

(To see this, note that P{H tj _ 1 = 0) = {\) tj ~ 1 = exp(-(log 2)^_i).) By Lemma 



4, we have 



P(Y t0 = ■ ■ = 0\% = 0) = J] P(Y tk = 0|Yi fc _ 1 = 0) > — - C : 



fc=0 



(SqS'i • • • <_i) 2 



1 • 



Since the {s' k } satisfy X]fc=o s 'k — + J> ^ follows that sup{ s /j s'qS^ ■ ■ ■ s^-_j < 
C-^j^Y <eMh-i)- Thus, 

(4.23) P(Y t0 = ■ ■ -Y t ._ x = 0\Y Q = 0) > ^exp(-^ i _ 1 ). 

Now (4.16) follows from (4.21)-(4.23) along with the fact that log 2 > \. □ 
We can now complete the proof of Lemma 2. From (4.2) and (4.10), we have 



2 <(2N + i)^ +1 Yl 

( N ' °r = 2N 
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Let Sj = sj + j - 1. Then (sj)~5 = (^"'(ff )* < -4^- Thus it follows from 
the above inequality that 

(4.24) ^±<(2N + l)hJ +1 p (Mi-Si-i^) - *- 

The replacement of Sj by Sj was made for technical reasons which will become 



clear below. Making the substitutions x r = and x r = for r = 0, j, and 

Xj = 2j^, we rewrite the right hand side of (4.24) as 

(4.25) 



(2N + l)h J+1 jz(2N) 1 ^ J2 (x x 1 ...x J - 1 X J )-2(2N)- j . 

Y J Xr =i 

(2N)x r is a nonnegative integer 

Let C X0)Xli ... )Xj ._ 1 denote the hyper-cube nCofcr, x r + ^] = H^[x r ,x r ]. Con- 
sider UC X0)Xlv .. )Xj l , where the union is over all {xq, ...,xj-i} for which (2N)x r is 



a 



nonnegative integer and X^r=o Xr — This union is contained in V l+ 



j 

2N 

3-1 



{(Vo,Vi, -,2/j-i) -yr>0, Y/ r =oVr < 1 + 2Ar}- We have 
(4.26) 

(x Xi...%_iXj)~2 < (yoyi.-S/j-ii/j) -3 , for all (y , 2/1, 2/j-i) e C' XOiXl) ... )Xj _ 1 , 

where = 1 + ^ - yo - i/i - - - 2/j-i- 
To see that (4.26) holds, note that x r > y r , r = 0, j — 1, for (yo,yi, yj-i) £ 

C'lo^iv^j-i 1 Also, 

■£ _ jSj_ _ gj + j — 1 = gj + j = 2N + j - s - si - ... - gj-i 
J ~ 2iV ~ 2iV 2iV 2iV 

= 1 + ^ - x - xx - ... - Xj -! > 1 + ^ - y - yi - ... - yj_i, 

for (y ,yi, 2/j-i) G C Xo 
In light of these facts it follows that the sum on the right hand side of (4.25) is 

dominated by a certain lower Riemann sum for J s (Ilr=o Vr)~^ dyodyi...dyj-i, 

X +2TV 

where S x = {(yo,yi, ...,2/j-i) ■ y r > 0, Er=o^ < A}. Replacing the sum in (4.25) 
with this integral, and substituting the resulting expression into the right hand side 
of (4.24) gives 

(4.27) ^|!<(2iV + l)^+ 1 ^(2iV)^ i f {f[y r )-^dy dy 1 ...dy j _ 1 . 
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A change of variables shows that 
(4.28) 

pi -pi 

I (Y[yr)~^dy dy 1 ...dy j _ 1 = (1 + ( JJ y r )-?dy dy 1 ...dy j - 1 . 

^ S 1 + J_ r=0 r=0 

+ 2JV 

As is well-known from the theory of Dirichlet distributions, 
(4-29) / (T\y r )-?dy dy 1 ...dy J - 1 = 

From (4.27)-(4.29) we conclude that 



3 + 1 
2 



(4.30) v ,J ' < (2^ + 1)2^+^-2(2^) — (l + ^-)~ | . 

v ; ^ 2JV ^2 - v v 2A r(^) 

The inequality for A(N,j) in Lemma 2 follows from (4.30). It is trivial to check 
that A(N,j) is increasing in A; thus, the inequality for A(k — in Lemma 2 
holds as stated. □ 

Proof of Lemma 3. To prove the lemma we will need the following lemma, which 
complements Lemma 6. 

Lemma 8. Let {X n ,Y n \'^ = Q be a simple, symmetric two-dimensional random walk. 
Let L > 0. There exist constants cl,i, cl >2 > such that for all sufficiently large n 

P((X 2n ,Y 2n ) = (0,0)|(A ,y ) = (2a, 0)) > ^exp(-^^), 

n n 

for a G Z satisfying |a| < L\fn. 

Proof. Let H n , V n be as in the proof of Lemma 6, and let {Z n } be as in Lemma 5. 
We have 

P((X 2n ,Y 2n ) = (0,0)|(A ,y ) = (2a, 0)) = 
(4-31) J2 P(Z 2j = 0\Z = 2a)P(Z 2k = 0\Z = 0)xP(H n = j,V n = k). 

j+k=n 

The proof of the lemma follows easily from (4.31), (4.8) and Lemma 5-ii. □ 

We can now prove Lemma 3. Let {(X n ,Y n )} denote a simple, symmetric two- 
dimensional random walk. Using the notation in the proof of Lemma 2, but with 
k — 1 in place of A, recall that for j = 1, 

P(D SOySl ) = P(Y S0 = 0\X = Y = A 2fc _ 2 = f 2fc _ 2 = 0), 
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and thus from (4.2), 

(4.32) = E P & = 0\X = Y = X 2k _ 2 = Y 2k _ 2 = 0). 

For m satisfying [jk] < m < [|/c], we have 



(4.33) 



P(Y 2m = 0\X = % = X 2k _ 2 = Y 2k _ 2 = 0) > 
[Vk] 

P(Y 2m = 0,X 2m = 2r\X = Y Q = 0)x 

r=-[Vk] 

P(X 2fc _ 2 = y 2fc _ 2 = 0\Xq = Yq = Y 2m = 0,X 2m = 2r) 
P(X 2k _ 2 = Y 2k _ 2 = 0\X = Y = 0) 

We have P(X 2k _ 2 = Y 2k _ 2 = 0\X = Y = Y 2m = 0, X 2m = 2r) = P(X 2k _ 2 _ 2m = 
Y 2k - 2 - 2m = 0\X = 2r, Y = 0). Thus, in light of the above-specified range of 
m and of r, it follows from Lemma 8 and Lemma 6-i that for sufficiently large k, 

P ( X 2 k ~2=Y 2k ^ 2 =0\X =Y =Y 2 =0,X 2m =2r) ^ bounded frQm below by a positive con _ 

P{X 2k _ 2 = Y 2k _ 2 =0\X =Y =0) 

stant. By Lemma 8 and the above-specified bound on m, it also follows that 
P(Y 2rn = 0, X 2m = 2r\Xo = Yq = 0) is bounded from below by ^, for some C > 0. 



Thus, we conclude from (4.33) that for sufficiently large k, 

P(Y 2m = 0\X = Y = X 2k _ 2 = Y 2k _ 2 = 0) > Cifc-*, 
(4.34) 1 3 

for some C\ > and for m satisfying [-k] < m < [~^k]. 

Now (4.32) and (4.34) give 

(2k-2\ z 
\ k-1 I 

for some C 2 > and k sufficiently large. This is clearly equivalent to the lemma. 
□ 

Acknowledgement. The author thanks the referees for their careful reading of the 
paper. He thanks one of them in particular for pointing out an error in a previous 
version of the paper. 

22 



References 



1. Aldous, D. and Diaconis, P., Longest increasing subsequences: from patience sorting to the 
Baik-Deift- Johansson theorem, Bull. Amer. Math. Soc. 36 (1999), 413-432. 

2. Baik, J., Deift, P. and Johansson, K., On the distribution of the length of the longest increasing 
subsequence of random permutations, J. Amer. Math. Soc. 12 (1999), 1119-1178. 

3. Bollobas, B. and Winkler, P., The longest chain among random points in Euclidean space., 
Proc. Amer. Math. Soc. 103 (1988), 347-353. 

4. Durrett, R., Probability: Theory and Examples, Duxbury Press — Wadsworth Pub. Co., 1991. 

5. Lifschitz, V. and Pittel, B., The number of increasing subsequences of the random permuta- 
tion, J. Combin. Theory Ser. A 31 (1981), 1-20. 

6. Logan, B. F. and Shepp, L. A., A variational problem for random Young tableaux, Advances 
in Math. 26 (1977), 206-222. 

7. Sachov, V., Probabilistic Methods in Combinatorial Analysis, Encyclopedia of Mathematics 
and its Applications, vol. 56, Cambridge University Press, 1997. 

8. Vershik, A. M. and Kerov, S. V., Asymptotic behavior of the maximum and generic dimensions 
of irreducible representations of the symmetric group., Functional Anal. Appl. 19 (1985), 21- 
31. 



23 



